The TUM Cumulative DTW Approach for the Mediaeval 2012 Spoken Web Search Task

نویسندگان

  • Cyril Joder
  • Felix Weninger
  • Martin Wöllmer
  • Björn Schuller
چکیده

This paper describes the system proposed for the Spoken Web Search task at Mediaeval 2012 campaign. We use an audio-only system based on our new called Cumulative Dynamic Time Warping (CDTW) algorithm. This algorithm combines the scores of all the alignment paths and allows for the learning of different cost functions for each kind of step in the alignment matrix (diagonal, horizontal and vertical). The results obtained with basic audio descriptors show the promising potential of our algorithm.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

SWS task: Articulatory phonetic units and sliding DTW

This paper describes the experiments conducted for spoken web search at MediaEval 2011 evaluations. The task consists of searching for audio segments within audio content using an audio query. The current approach uses a broad articulatory phonetic units for indexing the audio files and to obtain audio segments. Sliding DTW is applied on the audio segments to determine the time instants.

متن کامل

BUT2012 Approaches for Spoken Web Search - MediaEval 2012

We submitted two approaches as the required runs: Acoustic Keyword Spotting as the primary one (AKWS) and Dynamic Time Wrapping as the secondary one (DTW) for the Spoken Web Search task. We aimed at building a simple phone based language-dependent system. We experimented with universal context bottle-neck neural network classifier with 3-state phone posterior features or bottle-neck features.

متن کامل

TUKE MediaEval 2012: Spoken Web Search using DTW and Unsupervised SVM

This working paper provides the basic information about experiments conducted on audio documents within the MediaEval 2012 spoken web search evaluation project. The main purpose of these experiments was to build a robust and language independent system for spoken term detection. Therefore we have proposed query-by-example searching system based on the minimum-cost alignment of DTW algorithm and...

متن کامل

The L2F Spoken Web Search System for Mediaeval 2013

The INESC-ID’s Spoken Language Systems Laboratory (LF) primary system developed for the Spoken Web Search task of the Mediaeval 2013 evaluation campaign consists of the fusion of six individual sub-systems exploiting 3 different language-dependent phonetic classifiers. For each phonetic classifier, an acoustic keyword spotting (AKWS) sub-system based on connectionist speech recognition and a dy...

متن کامل

GTTS Systems for the SWS Task at MediaEval 2013

This paper briefly describes the systems presented by the Software Technologies Working Group (http://gtts.ehu.es, GTTS) of the University of the Basque Country (UPV/EHU) to the Spoken Web Search (SWS) task at MediaEval 2013. GTTS systems consist of four main modules: (1) feature extraction; (2) speech activity detection; (3) DTW-based query matching; and (4) score calibration and fusion. The m...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012